Overview
Brought to you by YData
Dataset statistics
| Number of variables | 11 |
|---|---|
| Number of observations | 1000 |
| Missing cells | 67 |
| Missing cells (%) | 0.6% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 126.0 KiB |
| Average record size in memory | 129.1 B |
Variable types
| Categorical | 7 |
|---|---|
| Numeric | 4 |
Age is highly overall correlated with id_student | High correlation |
Year is highly overall correlated with id_student | High correlation |
gender is highly overall correlated with id_student | High correlation |
id_student is highly overall correlated with Age and 6 other fields | High correlation |
lunch is highly overall correlated with id_student | High correlation |
math score is highly overall correlated with reading score and 1 other fields | High correlation |
parental level of education is highly overall correlated with id_student | High correlation |
race/ethnicity is highly overall correlated with id_student | High correlation |
reading score is highly overall correlated with math score and 1 other fields | High correlation |
test preparation course is highly overall correlated with id_student | High correlation |
writing score is highly overall correlated with math score and 1 other fields | High correlation |
Year is highly imbalanced (68.1%) | Imbalance |
Age has 67 (6.7%) missing values | Missing |
id_student is uniformly distributed | Uniform |
id_student has unique values | Unique |
Reproduction
| Analysis started | 2025-03-16 21:58:33.160409 |
|---|---|
| Analysis finished | 2025-03-16 21:58:46.292084 |
| Duration | 13.13 seconds |
| Software version | ydata-profiling vv4.14.0 |
| Download configuration | config.json |
Variables
gender
Categorical
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 47.9 KiB |
| MALE | |
|---|---|
| FEMALE |
Length
| Max length | 6 |
|---|---|
| Median length | 4 |
| Mean length | 4.966 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | MALE |
|---|---|
| 2nd row | FEMALE |
| 3rd row | MALE |
| 4th row | MALE |
| 5th row | MALE |
Common Values
| Value | Count | Frequency (%) |
| MALE | 517 | |
| FEMALE | 483 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| male | 517 | |
| female | 483 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 1483 | |
| M | 1000 | |
| A | 1000 | |
| L | 1000 | |
| F | 483 | 9.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4966 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| E | 1483 | |
| M | 1000 | |
| A | 1000 | |
| L | 1000 | |
| F | 483 | 9.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4966 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| E | 1483 | |
| M | 1000 | |
| A | 1000 | |
| L | 1000 | |
| F | 483 | 9.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4966 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| E | 1483 | |
| M | 1000 | |
| A | 1000 | |
| L | 1000 | |
| F | 483 | 9.7% |
race/ethnicity
Categorical
High correlation 
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 47.9 KiB |
| group C | |
|---|---|
| group D | |
| group B | |
| group E | |
| group A |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | group A |
|---|---|
| 2nd row | group D |
| 3rd row | group E |
| 4th row | group B |
| 5th row | group E |
Common Values
| Value | Count | Frequency (%) |
| group C | 323 | |
| group D | 262 | |
| group B | 205 | |
| group E | 131 | |
| group A | 79 | 7.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| group | 1000 | |
| c | 323 | 16.2% |
| d | 262 | 13.1% |
| b | 205 | 10.2% |
| e | 131 | 6.6% |
| a | 79 | 4.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| g | 1000 | |
| r | 1000 | |
| o | 1000 | |
| u | 1000 | |
| p | 1000 | |
| 1000 | ||
| C | 323 | 4.6% |
| D | 262 | 3.7% |
| B | 205 | 2.9% |
| E | 131 | 1.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 7000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| g | 1000 | |
| r | 1000 | |
| o | 1000 | |
| u | 1000 | |
| p | 1000 | |
| 1000 | ||
| C | 323 | 4.6% |
| D | 262 | 3.7% |
| B | 205 | 2.9% |
| E | 131 | 1.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 7000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| g | 1000 | |
| r | 1000 | |
| o | 1000 | |
| u | 1000 | |
| p | 1000 | |
| 1000 | ||
| C | 323 | 4.6% |
| D | 262 | 3.7% |
| B | 205 | 2.9% |
| E | 131 | 1.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 7000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| g | 1000 | |
| r | 1000 | |
| o | 1000 | |
| u | 1000 | |
| p | 1000 | |
| 1000 | ||
| C | 323 | 4.6% |
| D | 262 | 3.7% |
| B | 205 | 2.9% |
| E | 131 | 1.9% |
parental level of education
Categorical
High correlation 
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 47.9 KiB |
| some college | |
|---|---|
| associate's degree | |
| high school | |
| some high school | |
| bachelor's degree |
Length
| Max length | 18 |
|---|---|
| Median length | 16 |
| Mean length | 14.55 |
| Min length | 11 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | high school |
|---|---|
| 2nd row | some high school |
| 3rd row | some college |
| 4th row | high school |
| 5th row | associate's degree |
Common Values
| Value | Count | Frequency (%) |
| some college | 222 | |
| associate's degree | 203 | |
| high school | 202 | |
| some high school | 191 | |
| bachelor's degree | 112 | |
| master's degree | 70 | 7.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| some | 413 | |
| high | 393 | |
| school | 393 | |
| degree | 385 | |
| college | 222 | |
| associate's | 203 | |
| bachelor's | 112 | 5.1% |
| master's | 70 | 3.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 2397 | |
| o | 1736 | |
| s | 1667 | |
| h | 1291 | |
| 1191 | ||
| g | 1000 | |
| l | 949 | 6.5% |
| c | 930 | 6.4% |
| i | 596 | 4.1% |
| a | 588 | 4.0% |
| Other values (6) | 2205 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 14550 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 2397 | |
| o | 1736 | |
| s | 1667 | |
| h | 1291 | |
| 1191 | ||
| g | 1000 | |
| l | 949 | 6.5% |
| c | 930 | 6.4% |
| i | 596 | 4.1% |
| a | 588 | 4.0% |
| Other values (6) | 2205 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 14550 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 2397 | |
| o | 1736 | |
| s | 1667 | |
| h | 1291 | |
| 1191 | ||
| g | 1000 | |
| l | 949 | 6.5% |
| c | 930 | 6.4% |
| i | 596 | 4.1% |
| a | 588 | 4.0% |
| Other values (6) | 2205 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 14550 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 2397 | |
| o | 1736 | |
| s | 1667 | |
| h | 1291 | |
| 1191 | ||
| g | 1000 | |
| l | 949 | 6.5% |
| c | 930 | 6.4% |
| i | 596 | 4.1% |
| a | 588 | 4.0% |
| Other values (6) | 2205 |
lunch
Categorical
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 47.9 KiB |
| standard | |
|---|---|
| free/reduced |
Length
| Max length | 12 |
|---|---|
| Median length | 8 |
| Mean length | 9.392 |
| Min length | 8 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | standard |
|---|---|
| 2nd row | free/reduced |
| 3rd row | free/reduced |
| 4th row | standard |
| 5th row | standard |
Common Values
| Value | Count | Frequency (%) |
| standard | 652 | |
| free/reduced | 348 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| standard | 652 | |
| free/reduced | 348 |
Most occurring characters
| Value | Count | Frequency (%) |
| d | 2000 | |
| e | 1392 | |
| r | 1348 | |
| a | 1304 | |
| s | 652 | 6.9% |
| t | 652 | 6.9% |
| n | 652 | 6.9% |
| f | 348 | 3.7% |
| / | 348 | 3.7% |
| u | 348 | 3.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 9392 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| d | 2000 | |
| e | 1392 | |
| r | 1348 | |
| a | 1304 | |
| s | 652 | 6.9% |
| t | 652 | 6.9% |
| n | 652 | 6.9% |
| f | 348 | 3.7% |
| / | 348 | 3.7% |
| u | 348 | 3.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 9392 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| d | 2000 | |
| e | 1392 | |
| r | 1348 | |
| a | 1304 | |
| s | 652 | 6.9% |
| t | 652 | 6.9% |
| n | 652 | 6.9% |
| f | 348 | 3.7% |
| / | 348 | 3.7% |
| u | 348 | 3.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 9392 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| d | 2000 | |
| e | 1392 | |
| r | 1348 | |
| a | 1304 | |
| s | 652 | 6.9% |
| t | 652 | 6.9% |
| n | 652 | 6.9% |
| f | 348 | 3.7% |
| / | 348 | 3.7% |
| u | 348 | 3.7% |
test preparation course
Categorical
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 47.9 KiB |
| none | |
|---|---|
| completed |
Length
| Max length | 9 |
|---|---|
| Median length | 4 |
| Mean length | 5.675 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | completed |
|---|---|
| 2nd row | none |
| 3rd row | none |
| 4th row | none |
| 5th row | completed |
Common Values
| Value | Count | Frequency (%) |
| none | 665 | |
| completed | 335 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| none | 665 | |
| completed | 335 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1335 | |
| n | 1330 | |
| o | 1000 | |
| c | 335 | 5.9% |
| m | 335 | 5.9% |
| p | 335 | 5.9% |
| l | 335 | 5.9% |
| t | 335 | 5.9% |
| d | 335 | 5.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 5675 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 1335 | |
| n | 1330 | |
| o | 1000 | |
| c | 335 | 5.9% |
| m | 335 | 5.9% |
| p | 335 | 5.9% |
| l | 335 | 5.9% |
| t | 335 | 5.9% |
| d | 335 | 5.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 5675 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 1335 | |
| n | 1330 | |
| o | 1000 | |
| c | 335 | 5.9% |
| m | 335 | 5.9% |
| p | 335 | 5.9% |
| l | 335 | 5.9% |
| t | 335 | 5.9% |
| d | 335 | 5.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 5675 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 1335 | |
| n | 1330 | |
| o | 1000 | |
| c | 335 | 5.9% |
| m | 335 | 5.9% |
| p | 335 | 5.9% |
| l | 335 | 5.9% |
| t | 335 | 5.9% |
| d | 335 | 5.9% |
math score
Real number (ℝ)
High correlation 
| Distinct | 78 |
|---|---|
| Distinct (%) | 7.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 66.436 |
| Minimum | 13 |
|---|---|
| Maximum | 120 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 47.9 KiB |
Quantile statistics
| Minimum | 13 |
|---|---|
| 5-th percentile | 40 |
| Q1 | 56 |
| median | 66.5 |
| Q3 | 77 |
| 95-th percentile | 91 |
| Maximum | 120 |
| Range | 107 |
| Interquartile range (IQR) | 21 |
Descriptive statistics
| Standard deviation | 15.489927 |
|---|---|
| Coefficient of variation (CV) | 0.23315563 |
| Kurtosis | -0.14193308 |
| Mean | 66.436 |
| Median Absolute Deviation (MAD) | 10.5 |
| Skewness | -0.11548843 |
| Sum | 66436 |
| Variance | 239.93784 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 63 | 34 | 3.4% |
| 71 | 30 | 3.0% |
| 77 | 30 | 3.0% |
| 74 | 28 | 2.8% |
| 57 | 27 | 2.7% |
| 66 | 26 | 2.6% |
| 58 | 26 | 2.6% |
| 65 | 25 | 2.5% |
| 70 | 25 | 2.5% |
| 78 | 24 | 2.4% |
| Other values (68) | 725 |
| Value | Count | Frequency (%) |
| 13 | 2 | |
| 23 | 1 | 0.1% |
| 25 | 1 | 0.1% |
| 26 | 2 | |
| 28 | 2 | |
| 29 | 1 | 0.1% |
| 30 | 2 | |
| 31 | 2 | |
| 32 | 2 | |
| 33 | 4 |
| Value | Count | Frequency (%) |
| 120 | 1 | 0.1% |
| 100 | 14 | |
| 99 | 3 | 0.3% |
| 98 | 3 | 0.3% |
| 97 | 3 | 0.3% |
| 96 | 3 | 0.3% |
| 95 | 2 | 0.2% |
| 94 | 7 | |
| 93 | 5 | 0.5% |
| 92 | 7 |
reading score
Real number (ℝ)
High correlation 
| Distinct | 85 |
|---|---|
| Distinct (%) | 8.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 64.951 |
| Minimum | 15 |
|---|---|
| Maximum | 100 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 47.9 KiB |
Quantile statistics
| Minimum | 15 |
|---|---|
| 5-th percentile | 29 |
| Q1 | 54 |
| median | 68 |
| Q3 | 78 |
| 95-th percentile | 93 |
| Maximum | 100 |
| Range | 85 |
| Interquartile range (IQR) | 24 |
Descriptive statistics
| Standard deviation | 19.010049 |
|---|---|
| Coefficient of variation (CV) | 0.29268294 |
| Kurtosis | -0.41685377 |
| Mean | 64.951 |
| Median Absolute Deviation (MAD) | 12 |
| Skewness | -0.47058794 |
| Sum | 64951 |
| Variance | 361.38198 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 63 | 32 | 3.2% |
| 71 | 29 | 2.9% |
| 73 | 29 | 2.9% |
| 64 | 28 | 2.8% |
| 78 | 27 | 2.7% |
| 76 | 25 | 2.5% |
| 72 | 23 | 2.3% |
| 70 | 23 | 2.3% |
| 87 | 23 | 2.3% |
| 62 | 23 | 2.3% |
| Other values (75) | 738 |
| Value | Count | Frequency (%) |
| 15 | 1 | 0.1% |
| 17 | 3 | |
| 18 | 1 | 0.1% |
| 19 | 5 | |
| 20 | 1 | 0.1% |
| 21 | 1 | 0.1% |
| 22 | 1 | 0.1% |
| 23 | 4 | |
| 24 | 6 | |
| 25 | 7 |
| Value | Count | Frequency (%) |
| 100 | 19 | |
| 99 | 2 | 0.2% |
| 98 | 2 | 0.2% |
| 97 | 3 | 0.3% |
| 96 | 4 | 0.4% |
| 95 | 9 | |
| 94 | 4 | 0.4% |
| 93 | 8 | |
| 92 | 7 | 0.7% |
| 91 | 10 |
writing score
Real number (ℝ)
High correlation 
| Distinct | 57 |
|---|---|
| Distinct (%) | 5.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 70.339 |
| Minimum | 23 |
|---|---|
| Maximum | 100 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 47.9 KiB |
Quantile statistics
| Minimum | 23 |
|---|---|
| 5-th percentile | 42 |
| Q1 | 58 |
| median | 68 |
| Q3 | 79 |
| 95-th percentile | 100 |
| Maximum | 100 |
| Range | 77 |
| Interquartile range (IQR) | 21 |
Descriptive statistics
| Standard deviation | 19.160094 |
|---|---|
| Coefficient of variation (CV) | 0.27239645 |
| Kurtosis | -0.72643607 |
| Mean | 70.339 |
| Median Absolute Deviation (MAD) | 11 |
| Skewness | 0.22996218 |
| Sum | 70339 |
| Variance | 367.10919 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 100 | 225 | |
| 64 | 34 | 3.4% |
| 71 | 34 | 3.4% |
| 66 | 30 | 3.0% |
| 70 | 29 | 2.9% |
| 65 | 29 | 2.9% |
| 73 | 29 | 2.9% |
| 60 | 27 | 2.7% |
| 68 | 25 | 2.5% |
| 63 | 24 | 2.4% |
| Other values (47) | 514 |
| Value | Count | Frequency (%) |
| 23 | 2 | |
| 24 | 1 | 0.1% |
| 26 | 1 | 0.1% |
| 27 | 2 | |
| 28 | 2 | |
| 30 | 1 | 0.1% |
| 31 | 2 | |
| 32 | 3 | |
| 33 | 4 | |
| 34 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 100 | 225 | |
| 80 | 17 | 1.7% |
| 79 | 17 | 1.7% |
| 78 | 18 | 1.8% |
| 77 | 17 | 1.7% |
| 76 | 22 | 2.2% |
| 75 | 20 | 2.0% |
| 74 | 17 | 1.7% |
| 73 | 29 | 2.9% |
| 72 | 23 | 2.3% |
id_student
Real number (ℝ)
High correlation  Uniform  Unique 
| Distinct | 1000 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1499.5 |
| Minimum | 1000 |
|---|---|
| Maximum | 1999 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 47.9 KiB |
Quantile statistics
| Minimum | 1000 |
|---|---|
| 5-th percentile | 1049.95 |
| Q1 | 1249.75 |
| median | 1499.5 |
| Q3 | 1749.25 |
| 95-th percentile | 1949.05 |
| Maximum | 1999 |
| Range | 999 |
| Interquartile range (IQR) | 499.5 |
Descriptive statistics
| Standard deviation | 288.81944 |
|---|---|
| Coefficient of variation (CV) | 0.19261049 |
| Kurtosis | -1.2 |
| Mean | 1499.5 |
| Median Absolute Deviation (MAD) | 250 |
| Skewness | 0 |
| Sum | 1499500 |
| Variance | 83416.667 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 1000 | 1 | 0.1% |
| 1671 | 1 | 0.1% |
| 1658 | 1 | 0.1% |
| 1659 | 1 | 0.1% |
| 1660 | 1 | 0.1% |
| 1661 | 1 | 0.1% |
| 1662 | 1 | 0.1% |
| 1663 | 1 | 0.1% |
| 1664 | 1 | 0.1% |
| 1665 | 1 | 0.1% |
| Other values (990) | 990 |
| Value | Count | Frequency (%) |
| 1000 | 1 | |
| 1001 | 1 | |
| 1002 | 1 | |
| 1003 | 1 | |
| 1004 | 1 | |
| 1005 | 1 | |
| 1006 | 1 | |
| 1007 | 1 | |
| 1008 | 1 | |
| 1009 | 1 |
| Value | Count | Frequency (%) |
| 1999 | 1 | |
| 1998 | 1 | |
| 1997 | 1 | |
| 1996 | 1 | |
| 1995 | 1 | |
| 1994 | 1 | |
| 1993 | 1 | |
| 1992 | 1 | |
| 1991 | 1 | |
| 1990 | 1 |
Year
Categorical
High correlation  Imbalance 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 47.9 KiB |
| 2023 | |
|---|---|
| 1990 | 58 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2023 |
|---|---|
| 2nd row | 2023 |
| 3rd row | 2023 |
| 4th row | 2023 |
| 5th row | 2023 |
Common Values
| Value | Count | Frequency (%) |
| 2023 | 942 | |
| 1990 | 58 | 5.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2023 | 942 | |
| 1990 | 58 | 5.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 1884 | |
| 0 | 1000 | |
| 3 | 942 | |
| 9 | 116 | 2.9% |
| 1 | 58 | 1.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2 | 1884 | |
| 0 | 1000 | |
| 3 | 942 | |
| 9 | 116 | 2.9% |
| 1 | 58 | 1.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2 | 1884 | |
| 0 | 1000 | |
| 3 | 942 | |
| 9 | 116 | 2.9% |
| 1 | 58 | 1.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2 | 1884 | |
| 0 | 1000 | |
| 3 | 942 | |
| 9 | 116 | 2.9% |
| 1 | 58 | 1.5% |
Age
Categorical
High correlation  Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 67 |
| Missing (%) | 6.7% |
| Memory size | 47.9 KiB |
| 14.0 | |
|---|---|
| 17.0 | |
| 16.0 | |
| 15.0 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 14.0 |
|---|---|
| 2nd row | 17.0 |
| 3rd row | 14.0 |
| 4th row | 17.0 |
| 5th row | 16.0 |
Common Values
| Value | Count | Frequency (%) |
| 14.0 | 259 | |
| 17.0 | 242 | |
| 16.0 | 225 | |
| 15.0 | 207 | |
| (Missing) | 67 | 6.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 14.0 | 259 | |
| 17.0 | 242 | |
| 16.0 | 225 | |
| 15.0 | 207 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 933 | |
| . | 933 | |
| 0 | 933 | |
| 4 | 259 | 6.9% |
| 7 | 242 | 6.5% |
| 6 | 225 | 6.0% |
| 5 | 207 | 5.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3732 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 933 | |
| . | 933 | |
| 0 | 933 | |
| 4 | 259 | 6.9% |
| 7 | 242 | 6.5% |
| 6 | 225 | 6.0% |
| 5 | 207 | 5.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3732 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 933 | |
| . | 933 | |
| 0 | 933 | |
| 4 | 259 | 6.9% |
| 7 | 242 | 6.5% |
| 6 | 225 | 6.0% |
| 5 | 207 | 5.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3732 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 933 | |
| . | 933 | |
| 0 | 933 | |
| 4 | 259 | 6.9% |
| 7 | 242 | 6.5% |
| 6 | 225 | 6.0% |
| 5 | 207 | 5.5% |
Interactions
Correlations
| Age | Year | gender | id_student | lunch | math score | parental level of education | race/ethnicity | reading score | test preparation course | writing score | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Age | 1.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.047 | 0.061 | 0.049 | 0.038 | 0.029 | 0.000 |
| Year | 0.000 | 1.000 | 0.035 | 1.000 | 0.000 | 0.000 | 0.013 | 0.000 | 0.024 | 0.033 | 0.056 |
| gender | 0.000 | 0.035 | 1.000 | 1.000 | 0.004 | 0.207 | 0.107 | 0.066 | 0.172 | 0.000 | 0.225 |
| id_student | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.026 | 1.000 | 1.000 | 0.021 | 1.000 | 0.011 |
| lunch | 0.000 | 0.000 | 0.004 | 1.000 | 1.000 | 0.361 | 0.000 | 0.072 | 0.266 | 0.000 | 0.309 |
| math score | 0.047 | 0.000 | 0.207 | 0.026 | 0.361 | 1.000 | 0.100 | 0.133 | 0.720 | 0.152 | 0.792 |
| parental level of education | 0.061 | 0.013 | 0.107 | 1.000 | 0.000 | 0.100 | 1.000 | 0.000 | 0.309 | 0.034 | 0.119 |
| race/ethnicity | 0.049 | 0.000 | 0.066 | 1.000 | 0.072 | 0.133 | 0.000 | 1.000 | 0.073 | 0.000 | 0.102 |
| reading score | 0.038 | 0.024 | 0.172 | 0.021 | 0.266 | 0.720 | 0.309 | 0.073 | 1.000 | 0.331 | 0.841 |
| test preparation course | 0.029 | 0.033 | 0.000 | 1.000 | 0.000 | 0.152 | 0.034 | 0.000 | 0.331 | 1.000 | 0.295 |
| writing score | 0.000 | 0.056 | 0.225 | 0.011 | 0.309 | 0.792 | 0.119 | 0.102 | 0.841 | 0.295 | 1.000 |
Missing values
Sample
| gender | race/ethnicity | parental level of education | lunch | test preparation course | math score | reading score | writing score | id_student | Year | Age | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | MALE | group A | high school | standard | completed | 67 | 67 | 63 | 1000 | 2023 | 14.0 |
| 1 | FEMALE | group D | some high school | free/reduced | none | 40 | 29 | 55 | 1001 | 2023 | 17.0 |
| 2 | MALE | group E | some college | free/reduced | none | 59 | 60 | 50 | 1002 | 2023 | 14.0 |
| 3 | MALE | group B | high school | standard | none | 77 | 78 | 68 | 1003 | 2023 | 17.0 |
| 4 | MALE | group E | associate's degree | standard | completed | 78 | 73 | 68 | 1004 | 2023 | 16.0 |
| 5 | FEMALE | group D | high school | standard | none | 63 | 77 | 76 | 1005 | 2023 | 16.0 |
| 6 | FEMALE | group A | bachelor's degree | standard | none | 62 | 59 | 63 | 1006 | 2023 | 14.0 |
| 7 | MALE | group E | some college | standard | completed | 93 | 88 | 100 | 1007 | 2023 | 17.0 |
| 8 | MALE | group D | high school | standard | none | 63 | 56 | 65 | 1008 | 2023 | 15.0 |
| 9 | MALE | group C | some college | free/reduced | none | 47 | 42 | 45 | 1009 | 2023 | 16.0 |
| gender | race/ethnicity | parental level of education | lunch | test preparation course | math score | reading score | writing score | id_student | Year | Age | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 990 | MALE | group D | some college | standard | none | 67 | 55 | 53 | 1990 | 2023 | 16.0 |
| 991 | FEMALE | group C | associate's degree | standard | none | 87 | 93 | 100 | 1991 | 2023 | 17.0 |
| 992 | MALE | group C | some college | standard | none | 69 | 63 | 66 | 1992 | 2023 | 15.0 |
| 993 | FEMALE | group A | associate's degree | standard | none | 58 | 54 | 58 | 1993 | 2023 | 15.0 |
| 994 | MALE | group E | high school | free/reduced | completed | 86 | 82 | 75 | 1994 | 2023 | 16.0 |
| 995 | MALE | group C | high school | standard | none | 73 | 70 | 65 | 1995 | 1990 | 15.0 |
| 996 | MALE | group D | associate's degree | free/reduced | completed | 85 | 91 | 100 | 1996 | 2023 | 14.0 |
| 997 | FEMALE | group C | some high school | free/reduced | none | 32 | 17 | 41 | 1997 | 2023 | 17.0 |
| 998 | FEMALE | group C | some college | standard | none | 73 | 74 | 100 | 1998 | 2023 | 17.0 |
| 999 | MALE | group A | some college | standard | completed | 65 | 60 | 62 | 1999 | 2023 | 15.0 |